The Blizzard Challenge 2006 CMU Entry introducing hybrid trajectory-selection synthesis

نویسندگان

  • John Kominek
  • Alan W Black
چکیده

Acknowledging the lessons of Blizzard Challenge 2005 – that smooth prosodic cadence supersedes spectral resolution – but wanting a system devoid of vocoding artifacts – we introduce a hybrid trajectory-selection synthesizer. Using a parametric synthesizer to generate a pitch-synchronous sequence of F0/duration/power and spectral vectors, this trajectory serves as the target cost function for a unit selection synthesizer. The combination can unify the best attributes of two distinct categories of synthesizers, provided that the feature representation supports both. To this end, we also introduce a new perceptually-weighted harmonic representation of speech that is pitch-synchronous and retains phase information.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CMU Blizzard 2007: A Hybrid Acoustic Unit Selection System from Statistically Predicted Parameters

This paper describes CMU’s entry for the Blizzard Challenge 2007. Our eventual system consisted of a hybrid statistical parameter generation system whose output was used to do acoustic unit selection. After testing a number of varied systems, this system proved the best in our internal tests. This paper also explains some of the limitations we see in our techniques. The CMU system is identified...

متن کامل

The blizzard challenge 2005 CMU entry - a method for improving speech synthesis systems

In CMU's Blizzard Challenge 2005 entry we investigated twelve ideas for improving Festival-based unit selection voices. We tracked progress by adopting a 3-tiered strategy in which candidate ideas must pass through three stages of listening tests to warrant inclusion in the final build. This allowed us to evaluate ideas consistently without us having large human resources at our disposal, and t...

متن کامل

The CSTR entry to the Blizzard Challenge 2016

This paper describes the text-to-speech system entered by The Centre for Speech Technology Research into the 2016 Blizzard Challenge. This system is a hybrid synthesis system which uses output from a recurrent neural network to drive a unit selection synthesiser. The annual Blizzard Challenge conducts side-byside testing of a number of speech synthesis systems trained on a common set of speech ...

متن کامل

The CSTR entry to the Blizzard Challenge 2017

The annual Blizzard Challenge conducts side-by-side testing of a number of speech synthesis systems trained on a common set of speech data. Similar to 2016 Blizzard challenge, the task for this year is to train on expressively-read children’s story-books, and to synthesise speech in the same domain. The Challenge therefore presents an opportunity to investigate the effectiveness of several tech...

متن کامل

The Jess Blizzard Challenge 2006 Entry

This paper describes the version of the Jess system that participated in the Blizzard Challenge 2006. The Jess system consists of a suite of software tools for processing text and speech. The largest component of the system is a multi-platform unit selection speech synthesiser that uses Unicode and the International Phonetic Alphabet (IPA). The system has been designed to be modular so that dif...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006